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ABSTRACT 


Increasing  demand  for  information  system  capacity  has  prompted  researchers 
to  find  ways  to  improve  the  computer  systems  used  in  information  processing. 
Database  management  systems  (DBMS's)  represent  one  such  effort  to  provide  bet¬ 
ter  information  services  at  lower  costs.  In  order  to  minimize  response  time 
and  maximize  throughput,  it  is  desirable  that  a  DBMS  supports  multiple  users 
at  the  same  time,  allowing  multiple  transactions  to  run  in  parallel.  However, 
for  the  purpose  of  maintaining  database  consistency  and  integrity,  such 
parallelism  must  be  properly  controlled.  For  example,  suppose  two  trans¬ 
actions  that  transfer  money  into  the  same  bank  account  are  run  in  parallel. 
Without  proper  coordination  between  the  two  transactions,  it  is  possible  that 
both  of  them  would  read  the  same  old  balance,  modify  it  independently  and 
write  these  independently-modified  balances  back  to  the  database.  If  this 
happens,  the  final  balance  will  reflect  the  result  of  only  one,  but  not  both, 
of  the  transactions,  causing  one  transaction  to  be  lost.  To  prevent  such  vio¬ 
lation  of  database  consistency  and  integrity  from  taking  place,  the 
concurrency  control  facility  is  an  indispensable  component  of  the  database 
management  system. 

A  generally  accepted  criterion  for  correctness  of  a  concurrency  control 
algorithm  is  the  criterion  of  AZKZaJLizoibility  of  transactions.  The  classical 
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approaches  to  enforcing  serializability  are  the  two'phaie  locking  technique 
and  the  time.ita.mp  Ohdening  technique.  The  first  technique  ensures 
serializability  by  imposing  a  partial  order  on  all  transactions  based  on  their 
commit  order,  while  the  second  on  their  initiation  order.  Either  approach 
requires  that  a  read  operation  from  a  transaction  be  AtgiiteKCd  (in  the  form 
of  either  a  read  timestamp  or  a  read  lock),  so  that  a  write  operation  from  a 
concurrent  transaction  will  not  interfere  improperly  with  the  read  operation. 
However,  setting  a  lock  or  leaving  a  timestamp  with  a  data  element  is  an 
expensive  operation.  The  purpose  of  the  current  research  is  to  seek  ways  to 
reduce  the  overhead  of  synchronizing  certain  types  of  read  accesses,  and  at 
the  same  time  achieving  the  goal  of  serializability. 

To  this  end,  a  hierarchical  structure  is  proposed  here  as  the  means  for 
analyzing  opportunities  of  reducing  concurrency  overhead  in  a  database  appli¬ 
cation.  A  theorem  is  discovered  which  indicates  that  in  a  hierarhically 
decomposed  database  one  can  construct  yet  a  third  type  of  partial  order  of  all 
transactions  which  is  different  from  the  simple  commit  order  and  the  simple 
initiation  order  that  have  been  used  in  the  existing  algorithms.  This  new 
type  of  partial  order,  called  the  activity  link  otideti,  enables  a  new  algorithm 
for  concurrency  control  to  be  devised  which  is  believed  to  have  the  ability  of 
effectively  reducing  the  overhead  of  read  access  synchronization  in  a  database 
system  that  endorses  a  hierarchical  decomposition  of  the  database. 
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1.0  INTRODUCTION  AND  LITERATURE  OVERVIEW 

1.1  INTRODUCTION  TO  THE  CONCURRENCY  CONTROL  PROBLEM 

Increasing  demand  for  information  system  capacity  has  prompted  researchers 
to  find  ways  to  improve  the  computer  systems  used  in  information  processing. 
Database  management  systems  (DBMS's)  represent  one  such  effort  to  provide  bet¬ 
ter  information  services  at  lower  costs.  In  order  to  minimize  response  time 
and  maximize  throughput,  it  is  desirable  that  a  DBMS  supports  multiple  users 
at  the  same  time,  allowing  multiple  transactions  to  run  in  parallel.  However, 
for  the  purpose  of  maintaining  database  consistency  and  integrity,  such 
parallelism  must  be  properly  controlled.  Therefore  the  concurrency  control 
facility  is  an  indispensable  component  of  the  database  management  system. 

» 

The  role  of  a  concurrency  control  mechanism  is  to  preserve  the  atomicity  of 
a  user  transaction,  i.e.,  it  will  prevent  the  processing  of  a  transaction  from 
being  ZM.onzou6iy  int£Kte.OL\jzd  with  other  concurrenct  transactions,  so  that 
each  transaction  sees  a  consistent  database  state  and,  if  necessary,  can  be 
recovered  or  backed  out  as  a  single  unit. 

A  typical  example  of  database  inconsistency  induced  by  improper  interleav¬ 
ing  of  steps  of  concurrenct  transact_ons  is  shown  •*  n  Figure  1.  As  shown  in 
the  figure,  two  transactions  ^imultaner usly  accessing  the  same  piece  of  data 
may  result  in  lost  update,  leaving  the  database  in  an  incorrect  state.  This 
is  due  to  the  fact  that  the  database  access  steps  from  these  two  transactions 
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Currently  there  is  $100  in  Smith's  account. 
t1 :  Deposit  $50  in  Smith's  account. 

Withdraw  $50  from  Smith's  account. 

Schedule  of  steps  of  t]  and  t2 

t,  reads  Smith's  balance 
t2  reads  Smith's  balance 
t^  computes  new  balance  =  $150 
t2  computes  new  balance  =  $50 
t,  writes  new  balance 
t2  writes  new  balance 


Result  of  Smith's  account 

$100 

$100 

$100 

$100 

$150 

$50 


Figure  1.  An  example  of  database  inconsistency  induced  by  concurrent 


processing. 


are  not  properly  interleaved.  By  requiring  that  the  database  management  sys¬ 
tem  exercise  control  over  such  interleaving  of  concurrent  transactions,  the 
undesirable  effects  can  be  eliminated. 

1.2  THE  SCOPE  OF  CURRENT  RESEARCH 

A  generally  accepted  criterion  for  correctness  of  a  concurrency  control  algo¬ 
rithm  is  the  criterion  of  serializability  of  transactions,  which  means  that 
interleaving  is  harmless  so  long  as  one  can  show  that  the  net  effect  of  such 
interleaving  is  eqi UvaJLznt  to  iomz  6ZKA.aJU.zzd  pK0CZ66tng  (i.e.,  one  after 
another)  of  all  the  transactions  involved.  In  the  above  example,  it  is  appar¬ 


ent  that  the  steps  of  the  two  transactions  are  scheduled  in  such  a  way  tha 
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there  exists  no  serialized  schedule  {i.e.,  either  t,  after  t2  or  t2  after  t ,) 
that  would  have  generated  the  same  net  effect.  Therefore  the  schedule  of 
these  steps  is  not  serializable,  and  therefore  incorrect. 

The  classical  approach  to  enforcing  serializability  is  the  two-phaAZ  lock¬ 
ing  technique.  This  technique  locks  up  data  elements  being  accessed  by  one 
transaction  and  blocks  other  transactions  from  operating  on  these  data  ele¬ 
ments  until  the  first  transaction  is  finished.  Another  .  ;ioach  to  dealing 
with  this  problem  is  that  of  the  timestamp  OAdzrUng  tzch  l,  which  stamps 
the  data  elements  with  the  timestamps  of  the  transactions  . have  operated 
on  them  so  as  to  prevent  violation  of  a  pre-determineo  order  from  taking 
place. 

Either  approach  requires  that  a  read  operation  from  a  transaction  be  AZgiA' 
tZAZd  (in  the  form  of  either  a  read  timestamp  or  a  read  lock) ,  so  that  a  write 
operation  from  a  concurrent  transaction  will  not  interfere  improperly  with  the 
read  operation.  Setting  a  lock  or  leaving  a  timestamp  with  a  data  element  is 
an  expensive  operation.  It  not  only  incurs  an  write  operation  in  the  database 
(in  the  form  of  setting  the  read  lock  or  writing  the  timestamp) ,  but  also 
potentially  causes  delays  for  concurrent  transactions. 

The  purpose  of  the  current  research  is  to  seek  ways  to  reduce  overhead  of 
synchronizing  certain  types  of  read  accesses,  and  at  the  same  time  achieving 
the  goal  of  serializability.  A  more  comprehensive  overview  will  be  giver,  ir. 
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the  next  subsection  of  the  literature  in  concurrency  control,  including 
efforts  up  to  now  that  have  aimed  at  similar  goals.  Here  we  motivate  this 
research  further  through  the  following  simple  example. 

1.2.1  A  MOTIVATING  EXAMPLE 


Consider  an  inventory  database  application  of  a  retail  business  with  a 
database  shown  in  Figure  2.  There  are  several  types  of  transactions  that 
operate  on  this  database.  A  type  1  transaction  inserts  a  sales, 
sales-modif ication ,  or  a  merchandise-arrival  record  into  the  database  when  the 
event  of  a  sales,  sales  modification  or  arrival  of  certain  merchandise  occurs. 
A  type  2  transaction  is  generated  periodically  for  each  item  in  the  inventory 
to  compute  the  current  inventory  level  of  that  item.  This  transaction  visits 
the  sales,  sales-modification  and  merchandise-arrival  records  to  compute  the 
net  change  ever  since  the  last  computation  of  the  inventory  level  of  that 
item.  A  new  inventory  level  for  that  item  is  then  posted  in  the  inventory 
record . 

To  control  these  transactions  using  two-phase  locking,  a  type  2  transaction 
would  have  set  read  locks  on  every  read  access  it  has  generated  to  the  sales, 
sales-modification  and  merchandise-arr ' val  records.  Using  the  timestamp 
ordering  approch,  the  type  2  transaction  would  have  left  read  timestamp  for 
every  such  record  it  has  read.  However,  a  closer  look  at  the  application 
reveals  that  this  overhead  of  read-locking  or  read-timestamping  is  not  neces- 


Merc hand ise-Arriva  | 

(  Order# (Qty  [Arrival 

Figure  2.  An  example  inventory  database. 


sary.  Tne  sales  records,  for  example,  once  committed,  will  not  be  modified. 
In  other  words,  with  the  exception  of  being  deleted  for  garbage  collection 
purposes,  the  sales  records,  once  committed,  have  become  'read-only'  records. 
Therefore  the  function  served  by  setting  read  locks,  etc.,  is  no  longer 
needed. 


While  the  above  case  is  very  simple,  a  more  complicated  case  may  confuse 
the  situation  and  make  it  not  so  obvious  as  to  whether  or  not  the  read 
accesses  should  be  registered.  Suppose  there  are  type  3  transactions  which 
are  also  generated  periodically  to  check  for  the  need  of  reordering  certain 
merchandise  items.  This  type  of  transaction  reads  the  merchandise-arrival 
records  and  adjust  the  merchandise-on-order  records  (by  setting  the 
arrival-date  field  of  such  records.)  It  then  computes  a  gross  inventory  level 
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by  summing  the  current-inventory-level  and  the  quantities  indicated  by  the 
non-arrived  merchandise-on-order  records.  Based  on  this  gross  inventory  level 
a  decision  is  made  as  to  whether  to  reorder  this  item.  If  it  decides  to  dc 
so,  an  order  request  is  printed  and  a  merchandise-on-order  record  is  generated 
and  inserted. 

Under  this  situation,  whether  or  not  type  3  transactions  can  avoid  setting 
read  locks  or  read  timestamps  for  the  inventory  and  merchandise-arrival 
records  it  retrieves  is  not  so  obvious.  In  fact,  if  accesses  to  such  records 
by  type  3  transactions  are  not  controlled,  one  can  construct  situations  in 
which  serializability  is  violated.  Suppose  two-phase  locking  is  used.  Then 
we  can  construct  a  case  in  which  a  merchandise-arrival  record  y  for  item  x  is 
seen  by  a  type  2  transaction  to  compute  the  current  level  of  inventory.  This 
current  level  of  inventory  is  in  turn  seen  by  a  type  3  transaction  to  figure 
out  whether  to  reorder.  However,  this  type  3  transaction  may  not  have  seen 
the  merchandise-arrival  record  y.  As  shown  in  Figure  3,  a  timing  of  these 
three  transactions  can  be  found  such  that  if  the  type  3  transaction  does  not 
set  read  locks,  violation  of  serializability  occurs.  This  violation  would 
generate  adverse  effect  on  the  performance  and  integrity  of  the  database,  and 
is  clearly  undesirable.  A  similar  case  is  constructed  in  Figure  4  which  shows 
that  if  the  timestamp  ordering  technique  is  used  and  if  the  type  3  transaction 
does  not  leave  read  timestamps,  violation  of  serializability  is  also  possible. 
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starts  |  y 

t3  finishes 
reading  all 
merchandise- 
arrival  records 


Starts 
*  1 1 

commits 


t3  reads  current-level 
of  inventory 
(therefore  reads 
the  one  which  has 
included  y . ) 


time 


Figure  3.  If  read  locks  are  not  used,  an  anomoly  may  occur. 


Because  of  the  seeming  unpredictability  of  the  situation,  most  concurrency 
control  algorithms,  for  simplicity,  choose  to  ignore  such  opprotunity  for 
reducing  synchronization  overhead,  and  complies  strictly  to  the  assumption 
that  all  transactions  may  read  and  write  any  part  of  the  database  and  there¬ 
fore  every  access  has  to  be  controlled. 

1.2.2  THE  GOAL  OF  THE  CURRENT  RESERARCH 

The  goal  of  the  current  research  is  to  seek  for  a  iy^tzmatic.  zxploitcution  of 
the  kind  of  opportunity  for  reducing  concurrency  control  overhead  indicated  ir. 
the  case  above. 

Theoretically  speaking,  the  above  case  would  not  exist  if  all  the  derived 
values  of  the  entire  database  were  computed  at  the  time  of  the  entry  of  the 
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Figure  4.  If  read  timestamps  are  not  used,  an  anomoly  may  occur. 


event.  However,  it  is  our  belief  that  such  case  do  exist  widely  an  the 
real-life  applications  of  database  systems,  because  the  instant  computation  of 
all  the  derived  values  at  the  time  when  an  event  is  captured  is  both  costly 
and  unnecessary,  and  does  not  comply  with  how  the  organizations  that  use  a 
database  system  are  accustomed  to  operate.  Delayed  computations  also  supports 
certain  method  of  operation  that  an  organization  may  have  purposely  imposed, 
(e.g.,  the  organizational  unit  that  is  responsible  for  the  transaction  that 
logs  merchandise  arrival  should  not  have  the  Knowledge  of  the  current  invento¬ 
ry  level  or  the  details  of  the  merchandise  ordering  process.)  In  fact,  the 


above  case  can  be  generalized  further  by,  for  example,  adding  to  it  applica- 
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tions  that  read  from  the  reordering  records  and  the  merchandise-arrival 
records  to  generate  supplier  profile  records  in  the  database.  As  the  list 
goes  on,  it  is  believed  that  a  hierarchy  of  applications  exists  such  that  at 
each  level  of  the  hierarchy  the  transaction  only  reads  from  but  does  not  write 
into  data  produced  by  applications  of  an  earlier  level.  This  structure  pre¬ 
sents  a  definite  opportunity  for  concurrency  control  algorithms  to  explore. 

The  bases  for  our  technique  are  transaction  analysis  and  the  maintenance  of 
a  multi-version  database.  The  transaction  analysis  decomposes  the  database 
into  hierarchically  related  data  segments,  such  that  transations  that  write 
into  one  segment  will  only  read  from  the  same  data  segment  or  segments  of 
higher  levels.  The  technique  enables  read  accesses  to  higher-level  data  seg¬ 
ments  to  proceed  without  ever  having  to  wait;  it  requires  no  read  locks  or 
read  timestamps  be  set  for  such  accesses.  This  methodology  may  also  bear 
implications  on  database  and  transaction  design  activities. 

1.3  OVERVIEW  OF  RELEVANT  LITERATURE 


Concurrency  control  in  a  centralized  or  a  distributed  database  system  has 
been  an  active  research  area.  The  concept  of  database  consistency  has  been 
formally  analyzed  in  <Eswaran76,  Gray76>,  in  which  set-theoretic  notions  are 
used  to  formulate  the  concept.  A  consistent  schedule  of  concurrent  trans¬ 
actions  has  been  defined  to  be  one  which  is  equivalent  to  a  serialized  sched¬ 
ule.  Two-phase  locking  has  been  proposed  in  their  papers  as  a  mechanism  which 
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preserves  serializability .  This  notion  of  serializability  has  been  further 
explored  in  <Bernstein79>. 

Algorithms  for  database  concurrency  control  abound  in  the  literature.  The 
distributed  database  present  a  challenge  for  the  consistency  problem  which  has 
encouraged  development  of  many  new  algorithms.  <For  example,  Ellis77, 
Lamport78,  Rosenkrantz78 ,  Thomas79,  Bernstein80b. >  A  survey  and  comparison  of 
theories  and  algorithms  of  concurrency  control  can  be  found  in  <Bernstein80b, 
Badal80>.  Most  algorithms  are  considered  variations,  extensions  and/or  combi¬ 
nations  of  the  two  basic  techniques  for  concurrency  control  -  two-phase 
locking  and  time  stamp  ordering.  The  two-phase  locking  algorithm  ensures  con¬ 
sistency  by  imposing  a  partial  order  on  all  transactions  based  on  their  lock 
points.  (A  lock  point  of  a  transaction  is  the  point  in  time  when  the  locking 
phase  of  the  transaction  reaches  its  peak.)  The  timestamp  ordering  algorithm 
ensures  consistency  by  imposing  a  partial  order  on  all  transactions  based  on 
the  initiation  times  of  the  transations.  These  two  techniques  have  been  used 
as  the  basis  for  further  explorations. 

One  of  the  foci  of  recent  development  in  concurrency  control  algorithms 
centers  around  the  identification  of  techniques  that  increase  level  of  concur¬ 
rency  and/or  reduce  synchronization  overhead,  at  the  same  time  preserving  the 
correctness  of  the  algorithm.  One  dimension  of  such  effort  is  the 
multi-version  database.  It's  been  observed  that  keeping  multiple  versions  of 
database  elements  will  improve  concurrency  of  the  database  <PapaaimitriouS2> . 
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In  Papadimitriou ' s  paper,  it  is  shown  that  there  exists  an  infinite  hierarchy 
of  multi-version  serializability ,  and  proven  that  the  more  versions  a  DBMS 
keeps,  the  higher  is  the  level  of  concurrency  it  may  achieve. 

The  concept  of  a  timestamp-based  multi-version  database  system  was  first 
proposed  in  <Reed78>.  It  is  a  general  scheme  in  which  the  identifier  of  a 
data  element  consists  of  two  components:  the  name  of  the  data  and  the  version 
of  the  data.  In  Reed's  scheme,  retrieval  of  an  arbitrary  time  slice  of  the 
database  is  allowed. 

A  more  limited  multi-version  concept  was  developed  in  <Bayer80>.  In  his 
scheme,  one  previous  version  of  a  data  element,  which  has  been  saved  for 
recovery  purposes  when  the  data  element  is  going  through  changes  made  by 
uncommitted  transactions,  is  utilized  to  allow  read  accesses  to  proceed  with¬ 
out  having  to  wait  for  the  commitment  of  the  update  transaction.  In 
<Garcia-Molina82> ,  a  framework  of  strateties  for  processing  read-only  trans¬ 
actions  is  presented.  In  <Viemont82>,  an  interesting  method  for  concurrency 
control  is  devised  which  also  makes  use  of  this  one  extra  copy  of  data  ele¬ 
ments  to  synchronize  transactions  by  order  of  commit  time.  In  essance  his 
technique  is  one  which  blends  timestamp  ordering  and  two-phase  locking  in  one 
and  chooses  to  switch  to  one  or  the  other  at  the  most  opportune  time  sc  as  tc 
increase  level  of  concurrency.  In  <Stearns81,  Chan82>  the 
one-previous-version  method  was  extended  to  accommodate  multiple  previous  ver¬ 
sions  (but  does  not  allow  for  access  of  an  arbitrary  time  slice  cf  the 
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database  from  a  user.)  In  particular,  Chan's  method  is  based  on  two-phase 
locking  but  allows  the  read-only  transaction  to  receive  special  treatment  - 
they  do  not  have  to  set  read  locks. 

The  above  list  of  research  bears  resemblance  to  the  research  to  be  reported 
here.  However,  our  technique  is  one  which  is  timestamp  based  and  strives  to 
reduce  the  need  for  leaving  read  timestamps  for  not  just  read-only  trans¬ 
actions,  but  update  transactions  as  well. 

Another  dimension  of  efforts  of  reducing  synchronization  overhead  is  that 
of  conflict  analysis  <Bernstein80b> .  In  the  research  on  concurrency  control 
for  SDD-1,  conflict  analysis  was  proposed  which  exploits  a  priori  knowledge  of 
the  nature  of  the  transactions  to  be  run  in  the  system.  (To  some  extent,  some 
of  the  research  listed  in  the  previous  paragraphs  concerning  providing  special 
protocols  for  read-only  transactions  fall  into  this  category  too,  as  it 
exploits  the  knowledge,  albeit  a  limited  one,  of  the  nature  of  the  trans¬ 
actions,  namely,  whether  they  are  read-only  or  not.)  The  approach  reported  in 
the  present  paper  is  different  from  that  of  SDD-1  because  it  is  not  oriented 
towards  distributed  database  systems,  and,  because  of  the  special  structure  of 
applications  that  our  approach  exploits,  together  with  the  fact  that  multiple 
version  technique  is  employed,  the  protocols  are  much  less  restricted.  These 
new  protocols  are  therefore  are  more  practical  to  implement.  A  comparison 
between  the  approach  proposed  here  and  that  of  SDD-l  is  included  in  the  summa- 
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2.0  BASIC  CONCEPTS  OF  MULTI-VERSION  CONSISTENCY 

In  this  chapter,  we  present  the  basic  concept  of  multi-version  consistency 
in  graph-theoretic  terms.  This  material  is  mostly  taken  from 
<Papadimitriou82,  Bernstein82> ,  except  for  some  notational  differences.  We 
wish  to  establish  the  fact  that,  if  serializability  of  transactions  is  taken 
as  the  criterion  for  correctness  of  concurrency  control,  then  acyclicity  of  a 
transaction  dependency  graph,  constructed  as  a  result  of  a  multi-version 
scheduling  algorithm,  is  the  evidence  of  correctness  of  the  algorithm,  as  it 
preserves  serializability  of  transactions. 

We  present  the  following  definitions  and  theorem. 

Definition.  A  schedule  of  a  set  c.f  transactions  T,  denoted  as  S(T),  is  a 
sequence  of  steps,  each  of  which  is  denoted  as  a  tuple  of  the  form 
^transaction  id,  action,  version  of  a  data  granule>. 

The  action  can  be  read  (r)  or  write  (w) .  The  version  of  a  data  granule  is 
denoted  as  dv,  where  d  indicates  the  data  granule  and  v  indicates  the  version. 
If  the  action  is  write,  then  the  version  of  the  data  granule  included  in  the 
step  is  created  by  the  transaction.  If  the  action  is  read,  then  the  trans¬ 
action  reads  the  version  of  the  data  granule  indicated  in  the  tuple. 

An  example  of  a  schedule  is  <t1,w,d’>,  <t2,r,d’>,  <t2,w,dJ>,  <t3,r,dJ>. 
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Def  i  (li  ti 01) .  A  version  j  of  a  data  element  d  is  the  predecessor  of  a  ver¬ 
sion  k  of  d  if  <t1fw,dJ>  is  before  <t2,w,dk>  in  S(T)  where  t1(  t2  e  Tr  and 
there  exists  no  t  e  T  and  i  such  that  <t,w,d1>is  between  <t !  ,w,dJ>and  <t2; 
w,dk>  in  S (T) . 

Definition.  a  transaction  dependency  graph  of  a  schedule  SIT )  is  a 
digraph,  denoted  as  TG(S(T)),  where  the  nodes  are  the  transactions  in  T  and 
the  arcs,  representing  direct  dependenci es  between  transactions,  exist  accord¬ 
ing  to  the  following  rules: 
t2  -  1 1  e  A  iff 

(1)  <t1,w,dv>  and  <t2,r,dv>  are  in  S(T)  for  some  dv,  or 

(2)  <t,,r,dJ>  and  <t2,w,dk>  are  in  S(T)  for  some  dj,dk  where  dJ  is  the 
predecessor  of  dk. 

In  other  words,  the  transaction  dependency  graph  represents  a  relation  - 
(depends  on)  of  transactions  such  that  t2  -  t,  if  t2  reads  a  version  of  a  data 
granule  created  by  t,  or  if  t2  creates  a  version  of  a  data  granule  whose  pred¬ 
ecessor  has  been  read  by  t,. 

Definition.  Two  schedules  S,(T)  and  S 2 (T)  of  the  same  set  of  transaction 
set  T  is  said  to  be  equivalent  iff  TG(S1(T))  =  TG (S2 (T) ) . 
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Definition.  A  schedule  S(T)  is  serializable  if  there  exists  an  equivalent 
schedule  Ss(T)  where  all  transactions  in  Ss(T)  are  serialized,  (i.e.,  no  steps 
of  one  transaction  are  interleaved  with  steps  from  another  transaction.) 

In  <Bernstein82>  the  following  theorem  has  been  shown:  a  schedule  S(T)  is 
serializable  iff  TG(S(T))  is  acyclic. 
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3.0  HIERARCHICAL  DATABASE  DECOMPOSITION 

3.1  SOKE  GRAPHIC-THEORETIC  DEFINITIONS 

We  first  briefly  introduce  the  concept  of  a  digraph  called  a  transitive 
semi-tree.  This  concept  will  then  be  used  to  describe  the  desirable  database 
partition  to  which  our  concurrency  control  technique  can  be  applied. 
Informally,  a  semi-tree  is  a  digraph  such  that,  if  the  directions  of  the  arcs 
in  the  graph  are  ignored,  the  graph  appears  to  be  a  spanning  tree.  A  transi¬ 
tive  semi-tree  is  a  digraph  whose  transitive  reduction  is  a  semi-tree,  i.e., 
it  is  a  semi-tree  with  an  arbitrary  number  of  additional  transitively  induced 
arcs. 

Def i ni ti on .  A  semi-tree  is  a  digraph  such  that  there  exists  at  most  one 
undirected  path  between  any  pair  of  nodes  in  the  graph.  Every  arc  in  a 
semi-tree  is  called  a  critical  arc. 

Definition.  a  digraph  g  is  a  transitive  semi-tree  iff  its  transitive 
reduction  is  a  semi-tree. 

An  example  of  a  transitive  semi-tree  is  shown  in  Figure  5.  It  can  be  seen 
that  the  definition  of  a  transitive  semi-tree  is  more  relaxed  than  a  directed 
tree,  but  is  more  restricted  than  an  acyclic  directed  graph.  The  following 
two  properties  are  associated  with  the  transitive  semi-tree. 


Property .  A  path  in  a  transitive  seir.i-tree  is  a  critical  path  iff  it  is 
composed  of  critical  arcs  alone. 

Property .  There  exists  at  most  one  critical  path  between  any  pair  of  nodes 
in  a  transitive-semi  tree. 

3.2  DATABASE  PARTITION 

We  will  use  the  concept  of  a  data  hierarchy  graph  (DHG),  constructed  by 
means  of  transaction  analysis,  to  characterize  the  relationship  between  a 
database  partition  scheme  and  database  transactions.  As  will  be  shown  later, 
the  topology  of  the  DHG  of  a  particular  database  partition  scheme  will  indi¬ 
cate  whether  or  not  our  concurrency  control  technique  can  be  applied  to  that 
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partition  scheme.  Informally,  let  a  database  be  partitioned  into  data  seg¬ 
ments.  A  DHG  is  a  digraph  with  nodes  corresponding  to  the  data  segments  and 
arcs  constructed  in  such  a  way  that  there  is  an  arc  from  a  data  segment  D,  tc 
another  data  segment  Dj  if  and  only  if  one  can  find  a  potential  transaction  ir. 
the  database  system  that  updates  data  elements  in  D,  and  accesses  (i.e.,  reads 
or  writes)  data  elements  in  Dj.  In  other  words,  D,  -  Dj,  i  #  j,  indicates 
that  there  exist  transactions  in  the  system  that  would  link  updates  of  data 
elements  in  D,  to  the  content  of  data  elements  in  Dj. 

Definit  ion.  Let  Tu  be  a  set  of  update,  transactions  to  be  performed  or.  a 
database  D.  Let  P  be  a  partition  of  D  into  data  segments  D  1  ,D2» • • • ,DP.  A 
data  hierarchy  graph  of  P  w.r.t.  tu  is  a  digraph  denoted  as  DHG  (F ,  Tu)  with 
nodes  corresponding  to  the  data  segments  of  P  and  a  set  of  directed  arcs  join¬ 
ing  these  nodes  such  that,  for  i  #  j ,  D,  -*  D,  iff  there  exist  t  e  Tu  s.t. 
w(t)  f)  Dj  *  empty  and  a(t)  0  Dj  *  empty,  where  t  is  a  transaction,  w(t), 
r(t)  and  a(t)  the  write  set,  the  read  set  and  the  access  set  of  transaction  t. 
(The  access  set  a(t)  is  the  union  of  r(t)  and  w(t).) 

The  kind  of  database  partition  to  which  our  concurrency  control  technique 
can  be  applied  is  one  such  that  its  data  hierarchy  graph  satisfies  the 
topological  requirement  that  it  be  a  transitive  semi-tree. 

Definition.  A  partition  P  of  a  database  D  is  TST-hi erarchi cal  with  respect 
to  Tu  iff  DHG ( P , T u )  is  a  transitive  semi-tree. 
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Property.  Let  p  be  a  TST-hierarchical  partition  w.r.t.  Tu.  Then  t  f  Tu 
writes  in  one  and  only  data  segment  in  P. 

Proof.  Suppose  t  writes  in  two  distict  data  segments  D,  and  DJ#  then 
according  to  our  rule  of  construction  of  DHG(P,TU),  D,  -  DJf  Dj  -  D  ,  t 
DHG(P,TU),  therefore  DHG(P,TU)  is  no  longer  a  transitive  semi-tree,  which 
means  that  P  is  not  TST-hierarchical  w.r.t  Tu,  and  contradicts  the  assumption. 

Based  on  the  above  property,  a  TST-hierarchical  database  partition  P  also 
defines  a  transaction  classification  as  follows. 

Definition.  A  transaction  cl assi f i cati on  of  a  database  partition  which  is 
TST-hierarchical  w.r.t.  Tu,  is  a  partition  of  the  set  Tu  of  all  update  trans¬ 
actions  into  transaction  classes  T-,T;,  . ..,  Tn,  such  that  a  transaction  t  e 
T,  iff  t  writes  in  data  segment  D,. 

Therefore  a  transaction  classification  partitions  the  set  of  update  trans¬ 
actions  into  classes,  each  of  which  corresponds  to  a  data  segment  in  the  data 
partition.  We  define  the  image  of  the  data  hierarchy  graph  for  the  trans¬ 
action  classification  as  follows: 

Definition.  a  transacti on  hierarchy  graph  thg(p,tu)  of  a  database  parti¬ 
tion  P,  TST-hierarchical  w.r.t.  Tu,  is  a  digraph  where  the  nodes  are  trans¬ 
action  classes  Tj's  based  on  transaction  classification  defined  above,  and 
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arcs  connecting  these  nodes  such  that  T,  -  Tj  e  A  iff  D ,  —  D  exists  in 
corresponding  DHG(P,TU). 

Given  definitions  of  DHG  and  THG  above,  we  shall  denote  a  chHaCClI 
i  to  j  in  TKG  or  DHG  as  CP , 1 .  Therefore,  T,  -  Tk  -•  ...  -  1 ,  =  CF.J 
every  arc  is  a  critical  arc.  In  addition,  we  give  the  following  defintior. 

Definition.  We  define  higher  then  (denoted  as  t>)  as  a  partial  orde 
of  nodes  in  a  THG  or  a  DHG.  Specifically,  we  say  that  T:  higher  than  T,  ( 
,  T,)  iff  CP  , J  exists  in  the  graph. 
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4.0  SYNCHRONIZING  UPDATE  TRANSACTIONS 

Given  a  TST-hierarchicai  database  partition,  the  key  to  our  concurrency 
control  technique  is  the  recognition  that,  if  a  transaction  t  belongs  to  a 
class  T,  that  writes  data  segment  D,  and  reads  data  segment  DJf  and  Dj  is 
higher  than  D,  in  the  Data  Hierarchy  Graph,  then  this  transaction  would  appear 
to  be  a  read  only  transaction  so  far  as  Dj  is  concerned.  Therefore  when  a 
request  to  read  a  data  element  d  in  Dj  is  issued  by  t,  there  may  exist  a  prop¬ 
er  committed  version  of  b  that  is  6CL^e.  to  be  giver,  to  t  without  the  need  of 
leaving  a  read  timestamp  with  d.  However,  the  way  this  proper  version  is  com¬ 
puted  must  be  such  that  the  overall  serializability  is  enforced.  In  other 
words,  the  introduction  of  transaction  dependency  of  t  on  t ' ,  where  t'  is  the 
transaction  in  class  T_,  which  created  the  version  of  d  that  t  is  allowed  tc 
read,  must  never  induce  cycles  ir.  the  transaction  dependency  graph  as  defined 
in  Section  smulti . .  To  this  end,  a  function  called  the  activity  link  functicr. 
is  devised  to  compute  versions  that  cross-class  read  accesses  may  be  granted, 
and  a  theorem  which  testifies  to  the  correctness  of  this  computation  is  pre¬ 
sented.  Based  on  this  theorem,  a  concurrency  control  algorithm  is  also 
presented. 

Notations  . 

(1)  I(t)  =  the  initiation  time  of  a  transaction  t. 

(2)  C (t)  =  the  commit  time  of  a  transaction  t. 

(3)  TS(dv)  =  the  mititation  time  of  the  transaction  that  creates  the  ver¬ 
sion  v  of  a  data  granule  d,  i.e.,  the  write  timestamp  of  dv.  (A  data 


?  1 
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granule  is  the  smallest  unit  that  concerns  the  concurrency  control  com¬ 
ponent  of  the  database  system,  and  is  the  smallest  unit  of  accesses  so 
far  as  concurrency  control  is  concerned.) 

4.1  THE  ACTIVITY  LINK  FUNCTION 


The  following  definitions  and  properties  apply  to  a  database  with  a  parti¬ 
tion  P  which  is  TST-hierarchical  w.r.t.  Tu  and  has  a  corresponding  transaction 
classification. 

Definition.  A  function  I,ola  defined  for  a  transaction  class  T,  is  a  func¬ 
tion  which  maps  a  time  m  to  another  time  m’  such  that  m'  =  I,ola(m),  where  n' 
is  the  initiation  time  of  the  oldest  active  (i.e.,  uncommited  and  un-aborted) 
transaction  in  the  transaction  class  T,  at  time  m.  Formally, 


m  if  there  exists  no  t  e  T ,  active  at  time  m, 

I ,  ° 1  d(m)  =■ 

Min  (I(t))  otherwise,  where  t  c  T,,  I(t)  <  m  and  C(t)  >  m. 


Definition.  Let  the  activity  link  function  A,J  be  a  function  defined  for  a 
pair  of  transaction  classes  Tj  and  Tjf  where  T,  and  T:  are  transaction  classes 
such  that  Tj  t>  Ti.  A,j  recursively  maps  a  time  m  to  another  time  as  follows. 


'  I j  ° 1 d(m)  if  Ti  -  Tj  =  CP, j 

A i J (m)  =  \ 

Akj(Aik(m))  otherwise,  where 
T,  -  Tk  -...-  T j  =  C? , J . 
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.  j/  »  T  Old,  ..  _  j 

A.  (m)  *  I.  (I,  (m) )  ,  if  CP. 


Figure  6.  Graphical  represenation  of  the  A  function 


That  is,  the  function  A  maps  a  time  m  for  a  transaction  from  class  T,  to 
the  initiation  time,  A,J(m),  of  successively  (i.e.,  along  the  critical  path  cf 
THG)  the  oldest  active  transaction  in  the  class  T } .  For  example,  if  the  crit¬ 
ical  path  between  T4  and  T:  is  T,  -  Tk  -  TJf  then  A,J(m)  =  I i ° 1 d(l k° ’ °(r ■.))  . 
This  is  exemplified  in  Figure  6. 
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4.2  CONCURRENCY  CONTROL  ALGORITHM  FOR  UPDATE  TRANSACTIONS 

Based  on  the  definitions  given  above,  we  describe  in  this  subsection  the 
concurrency  control  algorithm  for  update  transactions  under  the  hierarchical 
decomposition  approach,  and  prove  its  correctness.  For  the  purpose  of  concur¬ 
rency  control,  we  assume  that  every  data  segment  is  controlled  by  a  segment 
contnollen.  which  supervises  accesses  to  data  granules  within  that  segment. 

Concurrency  control  algorithm  for  update  transactions: 

For  every  database  access  request  from  an  update  transaction  t  e  T,  for  a 
data  granule  d  e  DJf  the  following  protocol  is  observed: 

Protocol  A 

If  i  *  j,  then  the  segment  controller  of  Dj  provides  the  version  d°  of  d  such 
that 

TS(d°)  =  Max(TS(dv))  for  all  v  such  that 
TS(dv)  <  A,J(I(t)). 

(Note  that  no  trace  of  this  access  needs  to  be  registered  in  any  form  for  the 
purpose  of  concurrency  control  by  the  segment  controller.) 

Protocol  B 

If  i  =  j,  then  use  the  basic  timestamp  on.denJ.ng  pnotocoi  <Bernstein80>  or  the 
muJUj-oension  timestamp  ondening  protocol  <Reed78>. 
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4.3  PROOF  OF  CORRECTNESS 

To  show  that  the  above  algorithm  is  correct,  one  must  show  that 
serializability  is  enforced.  In  order  to  do  this,  we  define  a  relation  => 

between  a  pair  of  transactions  and  show  that  the  above  algorithm  allows  a 
transaction  t,  to  directly  depend  on  a  transaction  t2  only  if  t ,  =>  t2.(D- 

rect  dependency  is  defined  in  Section  fimulti.)  We  then  show  that  properties 
of  the  relation  =>  lead  to  Theorem  1,  which  concludes  that  the  above  algorithm 
preserves  serializability. 

Definition.  A  relation  topologically  follows  (denoted  as  =>)  is  defined 
for  a  pair  of  transactions  t1(  t2,  where  tt  t  T,,  t2  e  T i ,  T,  and  T:  are  con¬ 
nected  by  a  critical  path  in  THG,  i  and  j  not  necessarily  distinct.  We  say 
that  t,  topologi cal  1 y  follows  t2  (or  t,  =>  t2)  iff 

(1)  if  Ti  =  T j  then  I(t,)  >  I(t2). 

(2)  If  T,  t>  Tj  then  I(t,)  >  Aj'(I(t2)). 

(3)  If  Tj  +>  T,  then  I(t2)  <  A^dtt,)). 

Intuitively,  =>  is  a  relation  between  transactions  based  on  both  the  timing 
of  the  transactions  and  the  hierarchical  levels  in  the  THG  of  the  transaction 
classes  that  the  transactions  belong  to.  To  be  more  specific,  ’ti  =>  t2’ 
always  means  that  t,  is  'later'  than  t2.  However,  this  ’later’  is  not  only 
based  on  the  initiation  times  of  the  two  transactions  involved,  but  also  on 
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the  relative  levels  of  the  transaction  classes  to  which  t,  ana  t2  belong: 


Given  a  fixed  t2,  the  lower  the  level  of  t,,  the  later  t,'s  initiation  tine 
has  to  be  in  order  for  t1  =>  t2  tj  hold.  Clearly,  =>  is  defined  only  between 
transactions  that  belong  to  classes  that  are  on  a  critical  path  ir.  TK3 , 
because  otherwise  the  A  function  is  undefined.  This  relation  is  exemplified 
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in  Figure  7.  Two  interesting  properties  concerning  the  relation  -->  are  pre¬ 
sented  below: 

Property  1.1.  The  relation  =>  is  anti-symmetric.  (This  directly  follows 

from  the  definition  of  the  relation.) 

Property  1.2  (The  property  of  transitivity/ .  The  relation  =>  is 
Cri ti Cal -pat h  transitive,  i.e.,  if  there  exists  t,  e  T,,  t2  e  Tk,  t3  f  Tj, 
such  that  t1  =>  t2»  1 2  =>  t3  and  T,,  Tk  and  Tj  are  on  a  critical  path  in  THG , 
then  t ,  =>  1 3. 

Proof .  (See  Appendix) 

We  now  define  the  following  synchronization  rule  and  show  that  our  concur¬ 
rency  control  algorithm  enforces  this  rule. 

Definition,  we  say  that  the  partition  synchroni zati on  rule  (abbreviated  as 
PSR)  is  enforced  in  a  schedule  S(TU)  if,  for  any  tw  t2  £  TJ,  t,  -  t2  t 
TG (S (Tu) )  implies  that  t ,  =>  t2. 

A  concurrency  control  algorithm  enforces  the  partition  synchronization  rule 
if  it  allows  direct  dependencies  to  occur  between  transactions  t,  and  t2  only 
if  t1  =>  t2*  This  is  translated  into  the  following  three  cases: 
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(1)  If  t,  and  t2  are  in  the  same  transaction  class,  the  algorithm  must 
allow  t,  to  read  a  version  v  of  a  data  granule  d  created  by  t2,  or  to  create  a 
new  version  of  a  data  granule  d  whose  latest  version  dv  was  created  by  t2, 
only  if  t2  has  an  inititation  time  that  is  less  than  that  of  t,.  (i.e.,  only 
if  TS(dv)  <  l(t,)  .) 

Protocol  B  of  our  algorithm  satisfies  this  requirement. 

(2)  If  t ,  belongs  to  a  class  I,  cf  a  lower  level  while  t2  belongs  to  a 
class  Tj  of  a  higher  level,  then  the  algorithm  must  allow  t-,  to  read  dv  cre¬ 
ated  by  t2  only  if  t2  has  an  initiation  time  less  than  A^Utt,)).  (i.e.,  only 
if  TS(dv)  <  A,J(I(t1)).) 

Protocol  A  of  our  algorithm  satisfies  this  requirement. 

(3)  If  t,  belongs  to  a  class  Ij  of  a  higher  level  while  t2  belongs  to  a 
class  Tj  of  a  lower  level,  then  the  algorithm  must  allow  t-,  to  create,  at  time 
m,  a  new  version  of  a  data  granule  whose  predecessor  dv  has  been  read  by  t2, 
only  if  t,  has  an  initiation  time  greater  than  or  equal  to  AJ1(I(t2)). 

This,  however,  is  always  true  because,  by  the  very  fact  that  t ,  is  active 
at  time  m  and  I(t2)  <  m,  and  that  Aj’(I(t2))  yields  a  time  value  which  is  def¬ 
initely  smaller  than  the  initiation  time  of  the  oldest  active  transaction  ir 
class  Ti  at  time  m,  A,’(I(t2))  must  be  less  than  I(t,). 
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Therefore  we  conclude  that  our  algorithm  enforces  PSR.  What  is  left  tc  do 
in  proving  the  correctness  of  our  algorithm  is  tc  show  that  a  schedule  that 
enforces  PSR  is  also  correct.  The  following  theorem  therefore  completes  our 

proof. 

Theorem  1.  Let  TG(S(TU))  be  a  transaction  dependency  graph  of  a  set  c: 
udpate  transations  Tu  run  on  a  database  with  a  TST-hierarchical  partition  F, 
and  the  schedule  S  observes  the  partition  synchronization  rule  with  respect  t: 
the  transaction  classification  corresponding  to  P,  then  TG(S (Tu) )  has  n: 
cycles . 


Proof .  (See  Appendix) 
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5.0  SYNCHRONIZING  READ-ONLY  TRANSACTIONS 

What  has  been  discussed  is  the  algorithm  for  controlling  concurrent  update 
transactions.  Now  we  turn  to  the  read-only  transactions. 

For  a  read-only  transaction  t  that  reads  from  data  segments  that  lie  on  one 
critical  path  CP , J  of  the  DHG,  the  protocol  that  should  be  observe  is  the  same 
as  that  observed  by  the  update  transactions  in  a  class  immediately  below  the 
lowest  class  of  the  critical  path  CP , }  in  THG,  namely,  a  class  right  below 
class  T,.  (If  there  exists  no  class  below  T,  in  ThG,  then  a  fictitious  class 
can  be  created  to  'host'  this  read-only  transaction.)  Therefore  read-only 
transactions  will  have  to  obey  protocol  A  alone  and  will  not  cause  any  read 
timestamp  or  read  lock  to  be  generated.  This  is  graphically  presented  by 
transaction  t,  in  Figure  8. 

What  we  are  concerned  with  here  are  those  read-only  transactions  that  read 
from  any  combination  of  data  segments  that  do  not  lie  on  a  critical  path  in 
DHG,  as  illustrated  by  transaction  tj  in  Figure  8.  To  handle  these  trans¬ 
actions,  we  first  introduce  the  expended  ac.tA.vity  Zink  function  in  the  follow¬ 
ing  subsection. 

5.1  THE  EXTENDED  ACTIVITY  LINK  FUNCTION 

I..  the  previous  section  we  have  introduced  the  activity  link  function  which 
centers  around  the  linkage  between  transactions  in  classes  that  are  on  a  crit- 
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• .  t  reads  from  class 

(t  is  a  read-only  transaction) 


Figure  8.  Read-only  transactions  that  read  from  one  critical  path. 


ical  path  in  the  transaction  hierarchy  graph.  The  extended  function,  on  the 
other  \nd,  specifies  how  transactions  in  a  transaction  class  are  linked  to 
transactions  m  another  transaction  class  when  there  is  not  necessarily  any 
critical  path  that  connects  the  two.  This  function  is  used  to  provide  a  way 
of  computing  a  coruiUznt  dcutaba&e.  \tatZ  that  can  be  accessed  by  a  read  only 
transaction  that  reads  fr-m  any  comb:  ation  of  data  segments  in  the  database. 

We  will  first  introduce  the  functions  C,  ate  and  Bj1  that  can  be  considered 
conceptually  the  of  functions  I,ola  and  A,J.  Then  two  proper ^es. 
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the  relationship  between  the  functions  A,J  and  Bj ’  are  derived.  The  extended 
activity  link  function  E,J  is  then  defined  in  terms  of  functions  A  and  B,  and 
its  usefulness  is  indicated  in  a  lemma  that  follows.  The  existence  and  deri¬ 
vation  of  a  consistent  database  state  is  given  in  theorem  2,  which  makes  use 
of  the  extended  activity  link  function. 


Definiti on . 
er  m'  where  T, 


Let  C,  late:  ir.  -  m’  be  a  function  which  maps  time  m  to  ancth- 
is  a  transaction  class  and  C,late(m)  is  determined  as  fellows. 


C 


late 


(m) 


m  if  there  exists  no  t  e  T,  active  at  time  m. 

Max  (C (t ) )  otherwise,  where  t  t  T,.  I(t)  <  m  and  C(t)  >  n. 


That  is,  C,late(m)  is  the  lcute.it  commit  time  of  all  transactions  in  class  7 
that  started  before  or  at  time  m.  However,  to  make  C,late(m)  computable,  all 
such  transactions  must  have  committed  at  the  time  of  computation  of  C.iate(iT,). 
We  give  the  following  definition  concerning  the  computability  of  C,late(rr.). 


Definition.  Cj,a,e(m)  is  computable  at  time  m°  iff  there  exists  nc  trans¬ 
action  started  before  cr  at  time  m  that  is  still  active  at  time  m°. 


Now 
tion  A. 


we  introduce  a  function  which  is  conceptually  the  inverse  of  the  fur.c- 
While  the  A  function  maps  a  time  in  a  lower  level  class  to  the  initi¬ 


ation  time  of  some  transaction  in  a  higher  level  class,  the  B  function  maps  a 
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time  in  a  higher  level  class  to  the  commit  time  of  some  transaction  in  a  lower 
level  class: 

Definition.  The  Backward  activity  link  function,  defined  for  a  pair  of 
transaction  classes  T,  and  T:,  where  T_,  f  >  T , ,  denoted  as  Bj  '  (it,)  ,is  a  func¬ 
tion  which  maps  a  time  value  m  to  another  such  that 

[  Cj  1ate(m)  if  T,  -*  T j  =  CP  , J 

B/(m)  =  { 

Bk'(B,k(m))  otherwise, 

*■  where  T ,  -  . . .  -  Tk  -  T }  =  CP  ,  . 

The  following  two  properties  bind  the  functions  A  and  B  together  and 
formally  describe  how  they  are  the  inverse  of  each  other. 

Property  2.1.  AjJ(Bj'(uO)  £  m,  where  T,  -  ...  -  T;  =  CP , 1  in  the  trans¬ 
action  hierarchy  graph. 

Proof .  (See  Appendix) 

Property  2.2.  For  evey  positive  e,  A,J(E,’(m)  -  e)  <  m,  where  T,  -  ...  - 
Tj  =  CP  i j  in  the  transaction  hierarchy  graph. 

Proof .  (See  Appendix) 

Definition.  An  undirected  critical  path,  denoted  as  UCP,J,  is  an  ordered 
set  of  distinct  indices  of  transaction  classes  in  THG  such  that  UCP ,  ■  =  <i, 
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i 


ll,  i2,  ....  in,  j>  where  for  any  two  indices  h,  k  adjacent  in  the  set,  either 
Th  -  Tk  or  Tk  -  Th  is  a  crtical  arc  in  THG . 


It  is  obvious  that  for  a  TST-hierarchical  partition  there  exists  one  and 
only  one  UCP  in  THG  between  any  pair  of  transaction  classes.  While  the  activ¬ 
ity  link  function  A  is  defined  for  any  pair  of  transaction  classes  that  lie  or, 
a  critical  path,  the  extended  activity  link  function,  using  the  concept  of 
UCP,  is  defined  for  any  pair  of  transaction  classes. 

Definition.  The  extended  activity  link  function  defined  for  a  pair  cf 
transaction  classes  T,  and  TJf  denoted  as  is  a  function  which  maps  a 
time  value  m  to  another  such  that 


E i J (m) 


m  if  i  =  j , 

C,late(m)  if  i  *  j  and  Tj  -  T*  is  a  critical  arc  in  THG, 
Ij0lcl(m)  if  i  *  j  and  Ts  -  Tj  is  a  critical  arc  in  THG, 
Ekj(E1k(m))  otherwise,  where  <i,k,...,j>  =  UCP , J , 


The  following  lemma  illustrates  the  usefulness  of  the  extended  activity 
link  function. 

Lemma  2 .1 .  Let  Tk,  Tj  and  Tj  be  transaction  classes  in  a  THG  of  a 
TST-hierarchical  database  partition,  and  T,  and  Tj  are  on  one  critical  path. 
Then  for  any  time  value  m  and  t,  £  T,,  t2  £  TJf  if  I(t,)  <  Ek'(m)  and  I(t-)- 
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EkJ(m)  then  there  exists  no  t,  -  t2  in  the  transaction  dependency  graph 
TG (S (Tu) )  where  the  schedule  S  enforces  the  PSR's. 

Proof  .  (See  Appendix) 

Intuitively,  the  E  function  provides  a  way  cf  computing  a  Limt  woJLl  for  all 
transaction  classes  in  the  database  system  across  which  no  direct  dependency 
from  the  'older  side'  of  the  wall  to  the  'newer  side’  of  the  wall  can  occur. 
A  time  wall  TW(m,s)  is  the  set  of  all  times  Es'(m)  where  m  is  a  time,  Ds  is  a 
chosen  data  segment,  and  D,  is  any  data  segment.  This  concept  is  graphically 
presented  in  Figure  9.  The  significance  of  this  concept  is  that  if  a 
read-only  transaction  reads  the  latest  versions  of  data  granules  of  data  seg¬ 
ment  D,  which  are  right  before  the  time  indicated  by  the  time  wall  component 
Es'(m)  of  certain  time  wall  TW(m,s),  then  it  is  accessing  a  consistent  data¬ 
base  state  and  will  in  no  way  induce  cycles  into  the  transaction  dependency 
graph.  This  discussion  is  formally  presented  in  the  following  theorem. 

Theorem  2.  If  the  schedule  S  enforces  the  PSR  on  Tu,  and  for  every  d  e  D, 
that  a  read-only  transaction  tR  reads,  S  allows  it  to  read  the  version  d°  such 
that 

TS(d°)  =  Max  (TS  (d v) )  where  TS(dv)  <  EsMm), 
for  some  time  m  and  some  transaction  class  index  s,  then  TG(S(TU  U  tR))  has  no 
cycle. 


Proof .  (See  Appendix) 
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A  time  wall  TW(s,m)  is  such  that  no  direct 
dependencies  occur  between  a  transaction  on 
the  left  side  of  the  dotted  line  (i.e.,  l(t) 
<  E  (m)  )  and  that  on  the  right  side  of  the 
dotted  line  (i.e.,  I(t)> 

Figure  9.  The  E  function  used  as  a  ’time  wall.' 


In  other  words,  if  a  read-only  transaction  reads  the  latest  versions  cf 
data  granules  of  data  segment  D i  which  are  right  before  the  time  indicated  by 
the  time  wall  component  E5'(m)  of  certain  time  wall  TW(m,s),  then  it  is 
accessing  a  consistent  database  state  and  will  not  induce  cycles  into  the 
transaction  dependency  graph. 

5.2  CONCURRENCY  CONTROL.  PROTOCOL  FOR  READ-ONLY  TRANSACTION’S 

Making  use  of  Theorem  2,  a  read-only  transaction  t  that  reads  from,  data 


segments  that  do  not  lie  on  one  critical  path  in  DHG  should  be  given  versions 
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that  are  the  latest  before  certain  time  wall.  However,  to  compute  the  time 
wall  the  system  has  to  determine  the  starting  transaction  class  Ts  and  a 
starting  time  value  m.  While  the  choice  can  be  arbitrary,  it  is  theoretically 
desirable  that  the  following  criteria  are  met: 

(1)  Es’(m)  (for  all  T,  in  the  THG)  is  computable  at  I(t),  the  initiation 
time  of  the  read-only  transaction. 

(2)  There  exists  no  m'  >  m  such  that  Es'(m’)  is  computable  at  I ( t )  for  all 
T,  in  the  THG. 

The  first  criterion  stipulates  that  m  should  be  mall  enough  so  that  all  Es’ 
(m)  is  computable  at  I(t),  therefore  t  potentially  does  not  have  to  wait  until 
a  later  time  to  access  from  certain  segment.  (If  some  EsJ(m)  is  not  computable 
at  I(t),  t  would  have  to  wait  till  a  later  time  when  it  is  computable  before 
accessing  data  from  data  segment  Dj.)  The  second  criterion  strives  to  achieve 
reading  of  the  newest  poAAlble.  database  state. 

A  compromise  is  struck  here  in  devising  our  protocol  for  read-only  trans¬ 
actions.  First,  to  save  computation  time,  a  new  time  wall  is  computed  by  the 
system  at  certain  intervals  and  the  new  time  wall  is  'released'  to  all 
read-only  transactions  that  start  before  the  next  vZA-ilon  of  the  time  wall  is 
released  by  the  system.  (That  is,  there  is  no  need  to  compute  a  time  wall  for 
every  read-only  transaction.)  In  computing  the  next  version  of  the  time  wall, 
the  system  can  choose  arbitrarily  a  starting  class  Ts  which  is  of  one  of  the 
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lowest  levels  and  choose  m  to  be  the  current  time.  If  it  encounters  any  C, 
late  function  that  it  cannot  compute,  it  waits  until  it  becomes  computable. 
Eventually  enough  time  will  elapse  such  that  Es'(m)  becomes  computable  for  all 
T , ' s .  Then  a  newly  constructed  time  wall  is  released. 

Let  the  release  time  of  a  time  wall  TW(m,s)  be  denoted  as  RT  (TW  ( ir. ,  s) )  .  Now 
we  provide  the  formal  definition  of  the  read-only  transaction  synchronization 
protocol . 

Concurrency  Control  Algorithm  for  Read-Only  Transacti on 

For  every  database  read  request  from  a  read-only  transaction  t  for  a  data 
granule  d,  the  following  protocol  is  observed: 

Protocol  C 

Let  d  e  D,.  The  segment  controller  of  D,  provides  the  version  d°  of  d  such 
that 

TS(d°)  =  Max(TS(dv))  for  all  v  such  that 
TS(dv)  <  Es  1  (m) 

where  RT(TW(m,s))  =  Max (RT (TW) )  for  all  TW  such  that  RT (TW)  <  I(t). 
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6.0  SUMMARY 


A  new  technique  of  concurrency  control  for  database  management  systems  has 
been  proposed.  The  technique  makes  use  of  a  hierarchical  database  decompos¬ 
ition,  a  procedure  which  decomposes  the  entire  database  into  data  segments 
based  on  the  access  pattern  of  the  update  transactions  to  be  run  in  the 
system.  A  corresponding  classification  of  the  update  transactions  is  derived 
whei e  each  transaction  class  is  'rooted'  in  one  of  the  data  segments. 

The  technique  requires  a  timestamp  ordering  protocol  be  observed  for 
acesses  within  an  update  transaction's  own  root  segment,  but  enables  read 
accesses  to  other  data  segments  to  proceed  without  ever  having  to  wait  or  to 
leave  any  trace  of  these  accesses,  thereby  reducing  the  overhead  of  concurren¬ 
cy  control.  An  algorithm  for  handling  ad-hoc  read-only  transactions  in  this 
environment  is  also  devised,  which  does  not  require  read-only  transactions  tc 
wait  or  set  any  read  timestamp.  The  proof  of  correctness  of  these  algorithms 
in  terms  of  their  preservation  of  serializability  is  provided  through  a  set  of 
eight  properties,  three  lemmas  and  two  theorems.  A  comparison  of  the  SDD-i 
approach,  the  multi-version  two-phase  locking  approach  (MV2PL.)  and  the  Hierar¬ 
chical  Database  Decomposition  (HDD)  approach  proposed  here  is  given  in 
Figure  10. 

7.0  FUTURE  RESEARCH  DIRECTIONS 
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Figure  10.  A  comparison  of  the  HDD,  SDD-1  and  KV2PL  approaches. 


7.1.1  DYNAMIC  RESTRUCTURING  OF  DATABASE  DECOMPOSITION 

We  will  develop  a  scheme  for  handling  ad  hoc  transactions  that  introduce 
access  patterns  which  cause  an  original  partition  to  become  illegal.  This 
aspect  of  the  technique  is  important  for  two  reasons.  First,  any  database 
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partition  and  transaction  classification  based  on  an  a  priori  analysis  of  all 
possible  transactions  to  be  run  in  the  system  is  not  likely  to  be  fault-proof. 
Second,  some  transactions  that  are  not  frequently  run  but  demand  an  access 
pattern  which  causes  a  finer  partition  to  become  illegal  may  be  left  out  of 
the  pre-analysis  intentionally,  so  that  for  the  majority  of  the  time  the  sys¬ 
tem  can  operate  under  a  finer  partition  while  a  special  handling  is  adopted  to 
take  care  of  this  type  of  transactions  when  they  enter  the  system.  We  will 
try  to  achieve  a  scheme  which  can  dynair^caZZy  restructure  the  database  parti¬ 
tion.  That  is,  it  should  be  a  scheme  which  does  not  require  a  quiescence  of 
the  database  activity  in  order  to  perform  the  restructuring  or  the 
restoration. 

7.2  HIERARCHICAL  DATABASE  DECOMPOSITION  METHODOLOGY 


7.2.1  HANDLING  ACYCLIC  DECOMPOSITION 


The  present  technique  centers  around  the  transitive  semi-tree  form  of 
hierarchical  database  decomposition.  The  transitive  semi-tree,  while  much 
more  relaxed  than  a  directed  tree,  may  still  be  too  restricted  for  the  tech¬ 
nique  to  be  useful  in  some  applications.  Based  on  the  theories  developed  for 
the  current  technique,  we  propose  to  find  an  algorithm  that  will  transform  a 
database  partition  whose  data  hierarchy  graph  is  of  the  form  of  an  acyclic 
graph  to  a  legal  partition,  while  preserving  the  granularity  of  the  original 
partition  as  much  as  possible. 
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7.2.2  DATABASE  DECOMPOSITION  METHODOLOGY  VIA  DATA  ANALYSIS 


We  propose  to  study  in  detail  graph-theoretic  methodologies  that  can  be 
used  to  cluster  data  elements  of  a  database  to  arrive  at  a  legal  or  an  acyclic 
decomposition  of  the  database. 

7.3  IMPLEMENTATION  OF  THE  CONCURRENCY  CONTROL  PROTOCOLS 


Studies  will  be  made  of  the  complexity  of  implementation  of  protocols 
developed  in  the  current  and  future  research.  Algorithms  for  (l)  computing 
the  version  numbers  to  be  accessed  by  a  transaction,  (2)  maintaining 
multiple-versions  of  the  database,  and  (3)  garbage  collection  will  be 
addressed . 

7.4  THE  EFFICACY  OF  THE  HDD  APPROACH 


In  order  to  substantiate  the  underlying  assumptions  used  in  the  HDD 
approach  of  the  hierarchical  organization  of  databases,  we  propose  to  under¬ 
take  a  case  study  of  operations  of  some  real  organizations.  These  cases  will 
also  be  used  to  validate  the  claimed  advantages  of  the  HDD  approach  over  con¬ 
ventional  approachs  to  concurrency  control. 

7.5  DATABASE  COMPUTER  APPLICATIONS 
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One  of  the  motivations  for  the  current  research  is  to  find  a  way  to  opti¬ 
mize  the  concurrency  control  activities  inside  of  a  multi-processor  based 
database  computer  that  employs  a  hierarchical  decomposition  of  the  DBMS 
functionalities.  The  potential  of  the  current  technique  in  reducing 
inter-level  synchronization  communications  will  be  explored  and  algorithms  for 
concurrency  control  in  such  an  environment  will  be  proposed. 
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APPENDIX 

(I)  PROOF  OF  PROPERTY  1.2 

Property  1.2  (The  property  of  transitivity/. 

Proof .  We  consider  the  following  5  groups  of  cases: 

(1)  T,  =  Tk  =  Tj.  By  definition  of  =>  we  have  I(t,)  >  I(t2)  >  I(t3).  Tnere- 
fore  t,  =>  t3. 

(2)  T,  =  Tk  #  Tj.  Two  cases  are  considered: 

(2.1)  T,  t>  Tj.  Then  t2  =>  t3  implies  I(t2)2  Aj’(I(t3))  t,  =>  t2  implies 
I ( 1 1 )  >  I(t2).  Therefore  I(t,)  >  Aj  1  (I (t3)) .  Therefore  t,  =>  t3. 

(2.2)  Tj  +  >  T,.  Then  tj  =>  t2  implies  I(t,)  >  I(t2).  By  Property  0.2  we 
have  A,J(I(tj))  >  A,J(I(t2!).  t2  =>  t3  implies  A,J(I(t2))  >  I(t3). 
Therefore  {I(t  0)  >  I(t3).  Therefore  t,  =>  t3. 

(3)  Ti  *  Tk  =  Tj  Two  cases  are  considered: 

(3.1)  T,  t>  Tj.  Then  t2  =>  t3  implies  I(t2)  >  I(t3).  By  Property  0.2  we 
have  A j 1 ( I (t  2) )  i  Aj’(I(t3)).  t,  =>  t2  implies  I ( 1 1 )  >  Aj’(I(t2)). 
Therefore  I(t,)  >  Aj’(I(t3)).  Therefore  t,  =>  t3. 

(3.2)  Tj  t>  T,.  Then  t2  =>  t3  implies  I(t2)  >  I(t3).  t,  =>  t2  implies  A, 
(I(t,))  >  I  ( 1 2 )  -  Therefore  A^JKtt))  >  I(t3).  Therefore  t,  =>  t3. 

(4)  T,  =  Tj  ¥  T k .  Two  cases  are  considered: 

(4.1)  T,  t>  Tk.  Then  t,  =>  t2  implies  I ( t , )  >  Ak’(I(t2)).  t2  =>  t3 

implies  Ak’(I(t2))  >  I(t3).  Therefore  I ( t  j )  >  I(t3).  Therefore  t,  => 
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(4,2)  Tk  t>  T,.  Then  t,  =>  t2  implies  A.^Kt,))  >  I  (t  2>  -  t2  =>  t3 

implies  I(t2)>  A,k(I(t3)).  Therefore  A,  k  (I  (t,))  >A,kd(t3)).  By 
Property  0.2  we  have  I(t,)  >  I(t3).  Therefore  t,  =>  t3. 

(5)  T,  #  Tk  *  T j ,  T,  *  Tj.  Six  cases  are  considered: 

(5.1)  Tj  t>  Tk  t>  T(.  Then  tt  =>  t2  implies  A^dd,))  >  I(t2).  From 
Property  0.1  and  0.2  we  have  A,Jd(t,))  =  Ak  J  (A  ,  k  (I  (t , ) ) )  i  AkJd(t2)). 

Therefore  A,J(I(t,))  ±  AkJ(I(t2)).  t2  =>  t3  implies  AkJ(I(t2))  > 

I (t 3) .  Therefore  A,J(I(ci))  >  I(t3).  Therefore  t,  =>  t3. 

(5.2)  T,  t>  Tk  t>  Tj.  Then  t2  =>  t3  implies  I(t2)^  A,k(I(t3)).  From 
Property  0.1  and  0.2  we  have  Ak1(I(t2))  -  Ak’(Ajk(I(t3)))  =  Aj'(I(t3)). 

tj  =>  t2  implies  I(ti))  5  Ak’(i(t2)).  Therefore  I(t,)  i  A;  1  (I  (t,)). 
Therefore  t,  =>  t3. 

(5.3)  Tj  t>  t,  t>  Tk.  Then  t,  =>  t2  implies  I(t,)J  Ak’(I(t2)).  From. 
Property  0.1  and  0.2  we  have  A,J(I(ti))  i  A  , J  (Ak  1  (1  (t  2)  )  )  =  AkJ(I(t:))t 

t2  =>  t3  implies  AkJ(I(t2))  >  I(t3).  Therefore  A,Jd(ti))  >  1 
Therefore  t  -  =>  t3. 

(5.4)  Tk  t>  T,  t>  Tj.  Then  t,  ->  t2  implies  A1k(I(t1))  >  I  ( 1 2 ) .  t2  =>  ta 
implies  I(t2d  Ajk(l(t3)).  From  Property  0.1  and  0.2  we  have  A,k 

(I (t  i) '  >  A  j  k (I (t  3) )  =  A i k (A j  1  (I (t  3) ) )  Therefore  1(1,)  >  Aj’(I(t3)). 
Therefore  t ,  =>  1 3. 

(5.5)  Tk  ♦>  Tj  t>  T,.  Then  t,  =>  t2  implies  A,k(I(tn))  >  I(t2).  t2  =>  tj 
implies  I(t2d  Ajk(I(t3)).  Therefore  A,k(I(t,))  >  Ajk(I(t3)). 

However,  Aik(I(t1))  =  A  }  k  (A  , J  ( I  (t , ) ) )  .  Therefore  A,jd(t,))  >  I(t3). 


Therefore  t,  =>  t3. 
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(5.6)  T,  t>  Tj  t>  Tk.  Then  t,  =>  t2  implies  I(t,)>  Ak1(I(t2)).  But  Ak’ 
(I ( 1 2 ) )  =  Aj  1  (Ak j (I (t 2) ) ) .  And  t2  =>  t3  implies  AkJ(I(t2))  >  I(t3). 
Therefore  Ak  ’  (I  (t2) )  >  A  1  ( 1  ( 1 3) ) .  Therefore  Kt,)  >  AJ1(I(t3)). 
Therefore  t ,  =>  t3. 

In  each  group,  we  have  permutated  the  order  of  levels  among  the  distinct 
transaction  classes  to  arrive  at  a  total  13  cases.  These  cases  exhaust  all 
the  possible  situations  that  govern  t1(  t2  and  t3  and  for  every  situation 
transitivity  is  shown  to  hold.  Therefore  we  conclude  that  =>  is 
critical-path  transitive. 

(II)  PROOF  OF  THEOREM  1 

In  order  to  prove  Theorem  1,  we  first  give  the  following  two  definitions 
and  a  lemma  about  the  transaction  dependency  graph. 

Def i ni t i on .  A  critical  path  dependency ,  between  two  distinct  transactions 
t|  €  T,  and  t2  e  Tj,  denoted  as  CD(t1r  t2),is  a  cycle-free  dependency  path 
from  t,  to  t2  in  TG(S(TU))  and  T,  and  T_,  are  on  one  critical  path  in  THG ,  i 
and  j  not  necessarily  distinct. 

Definition,  a  boundary  critical  path  dependency  m  tg(s(tu))  between  two 
transactions  t,  e  T,  and  t2  £  T:,  where  t,  *  t2,  denoted  as  BCD(t1,  t2),is  a 
CD(tj,  t2)  such  that  either  or  both  of  the  following  are  true: 
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1.  There  exists  t3  £  Tk  such  that  t,  -  t3  £CD(t,,t2)  and  T,,  T0  and  Tk 
are  not  on  one  critical  path; 

2.  There  exists  t4  £  Ti  such  that  t4  -  t2  £CD(t,,t2)  and  T,,  T_,  and 
are  not  on  one  critical  path. 

Property.  If  BCD(t1r  t4),  where  t,  £  T,  and  t4  t  T,,  then  there  exist  t2 
£  Tk  and  t3  e  T1(  t2,  t3  not  necessarily  distinct,  such  that  CD(t1(  t2)  C 
CD  (t  i ,  t4),  CD (t  2 , t  3)  C  CD  (t  •( ,  t4),  CD  (t  3 , 1 4)  C  CD  (t  i ,  t4)  and  T  , ,  T  j ,  TK 
and  Ti  are  on  one  critical  path  in  THG.  (This  directly  follows  froir  the  fac 
that  THG  is  a  transitive  semi-tree.) 

Lemma  1 .  If  there  exists  a  critical  path  dependency  CD(t,,t2)  in  a  trans 
action  dependency  graph  TG(S(T))  where  the  schedule  S  enforces  the  partition 
synchronization  rule,  then  ti  =>  t2. 

Proof.  Let  l  he  the  length  (in  number  of  arcs,  i.e.,  direct 
dependencies)  of  a  critical  path  dependency.  Then  l  has  a  total  order  and 
is  bounded  from  below  by  l.  By  way  of  complete  mathematical  induction,  to 
prove  that  if  CD(t1rt2)  then  t,  =>  t2,  we  have  to  show  the  following: 

(1)  If  £ (CD (t  i , 1 2) )  =  1  then  t,  =>  t2. 

(2)  If  £(CD(t1#t2))  =  g  and  if  tB  =>  tb  for  ail  ta,  tb  s.t.  there  exists 
CD (t a , t b)  and  £ (CD(ta. tb) )  <  g,  then  t,  =>  t2. 

Now  we  prove  the  above  two  statements. 

(1)  In  this  case,  CD(t1(t2)=  tf  -  t2.  By  property  1.3  we  have  t,  =>  t-. 
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(2)  To  prove  the  second  statement,  let  t3  t  Tk  and  t4  s  T|  be  such  that 
ti  -  t3  £  CD(t1f  t2),  t4  -  t2  £  CD ( t  t ,  t2),  and  a  path,  denoted  as 
Path(t3,t4),  from  t3  to  t4  such  that  Path(t3,t4)  C  CD(t,,t2).  Also  let 
t1  £  Ti  and  t2  £  Tj.  Consider  the  following  two  cases: 

(2.1)  If  CD (t  i , 1 2 )  is  not  a  BCD,  then  Path(t3,t4)  is  a  CD(t3,t4) . 

Since  £ (CD (t  i , t2) )  <  g  therefore  t,  =>  t2.  And  by  the  definition  of 
CD,  T,,  Tj,  Tk  and  Tj  must  be  on  one  critical  path  of  THG .  Therefore 
we  have  t,  -  t3,  t4  -  t2  and  t3  =>  t4.  By  property  1.2  (i.e.,  the 
property  of  critical  path  transitivity)  we  have  ti  =>  t2. 

(2.2)  If  CD(t1ft2)  -Li  a  BCD,  then  by  the  property  above  of  a  BCD  we 
have  that  there  exist  t5  £  Tm  and  t6  £  Tn  such  that  CD(t1ft5)  C  CD(ti 

t2),  CD ( 1 5 , t  g)  C  CD (t  t , 1 2) ,  and  CD(t6,t2)  C  CD(t1(t2),  where  Tm,  Tn, 
T,  and  Tj  are  on  one  critical  path  of  THG.  Since  l  {CD  It  u  t5))< 
g,  therefore  ti  =>  t5.  Similarly,  t6  =>  t2  and  t5  =>  t6.  By  proper¬ 
ty  1.2  we  conclude  ti  =>  t2.  Q.E.D. 

Theorem  1 . 

Proof .  Suppose  there  exists  a  cycle.  Then  the  cycle  involves  at  least  two 
transactions  ti  and  t2  that  belong  to  transactions  that  are  on  one  critical 
path.  This  means  that  there  exist  CD(t1rt2)  and  CD(t2,ti).  By  the  above 
lemma,  CD(ti,t2)  implies  ti  =>  t2  and  CD(t2,ti)  implies  t2  =>  ti.  However,  => 
is  anti-symmetric  (by  property  1.1).  Therefore  ti  =>  t2  and  t2  =>  ti  cannot 
be  true  at  the  same  time.  Therefore  there  can  be  no  cycle  in  this  transaction 
dependency  graph.  Q.E.D. 


INFOPLEX  Technical  Report  #12 


(III)  PROOF  OF  PROPERTY  2.1 

Property  2.1.  A,J(Bj'(m))  >  m,  where  T,  -  T ,  , - ...  -  T,(n-u  -  T,n  -  T, 

=  CP , J  in  the  transaction  hierarchy  graph. 

Proof.  A,J(B3'(m;,  =  A , j (C , , (... (C , n(Cj (m) ))...)) .  (C3  is  an  abbreviated 
expression  for  C3lateand  I3  is  an  abbreviated  expression  for  I3old.)  Let  m,  = 
Cj(m).  Then  m3  =  C(t30)  if  there  exists  t  eT3  active  at  time  m  and  C3  (ir.)  =  cftj0), 
and  m.  =  rr,  if  there  exists  no  t  e  T3  active  at  time  m.  Therefore  A,  MB.1 
(m))  =  A  , J  (C  ,,(...  (C  ,  n(ir.j ))...))  .  Continue  substitution  of  the  L  function  in 
the  expression  with  similarly  defined  min,  ...,  mn,  we  get  A,J(B3’(m))  =  A,] 
(mM).  Now  we  start  spelling  out  the  function  A  , J :  A,j(m,.,)  =  A  ,  , j  ( I  ,  ,  (m  ,  1  )). 

Consider  the  following  two  cases: 

(1)  If  there  exists  no  t  e  T,i  active  at  m,1f  then  Ii1(mi1)=  m ,  , .  Since  ic,, 

=  C,  ,(111,2)  *  mi2'  we  have  Iii(u,i)i  ml2. 

(2)  If  there  exists  t  «  T , ,  active  at  mi1f  then  I,1(m,1)=  I(ti1,)>  where  t,/ 

*  t,,°  (since  t ,  ,  ’  is  active  at  m ,  ,  while  t,,0  commits  at  m ,  , ) ,  and 

I(t,r)  i  15,2  (since  if  Kt,,1)  <  m,2  then  during  the  previous  applica¬ 
tion  of  C ,  1 ,  C,i(m,2)  should  be  equal  to  C(t,1')  and  not  C(t,1°),  and 
contradicts  the  assumption.)  Therefore  I,1(m,1)2  m,2. 

Therefor  we  conclude  In(rn,,)>  m,2.  By  the  same  reasoning  we  continue  spell¬ 
ing  out  the  A  function  to  arrive  at  the  following:  A,J(Bj'(m))  =  A  ,  nJ ( I , n (m , r  )), 
Since  I,n(min)  -  =  Cj(m),  we  have  A,J(BJ1(m))  >  A ,  nJ  (C3  (m)  )  .  Since 

A ,  n  J  (C  j  (m) )  =  Ij(Cj(m))  £  m,  we  have  A^tBj’fm))  >  m.  Q.E.D. 
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(IV)  PROOF  OF  PROPERTY  2.2 

Property  2.2.  A , J (Bj 1 (m)  -  e)  <  m,  where  T,  -  T , ,  -  -  ...  -  T, , n. ,  ,  -  T , r 
-  Tj  =  CP i j  in  the  transaction  hierarchy  graph,  and  e  a  small  value. 

Proof .  Let  irij,  m)n,  _ m,  be  defined  in  the  same  way  as  in  the  proof  cf 

Property  2.1.  We  have  A1J(BJ’(m)-£)  =  A,J(m,  ,  -  e)  =  A,1-’(I,i(m11  -  «)).  Nov: 

we  show  that  -  c)  <  m,2.  Consider  the  following  two  cases: 

(1)  If  there  exists  no  t  t  T, i  active  at  ml2,  then  m,!  =  Cii(m,2)  =  m,2. 

Therefore  I,  ,(mn  -  e)  =  I,i(m,2  -  e)  <  m,2  -  e  <  ml2. 

(2)  If  there  exists  t  e  Tt1  active  at  m,2,  then  m,i  =  Cj^m^)  =  C(t(i°) 

where  Ift,,0)  <  m,2.  Therefore  I, ,(111,1  -  e)  =  I^tClt,  ,°)~  t)  £  Itt,,0^ 

<  m  i  2- 

Therefore  we  conclude  I,1(m,1  -  e)  <  m,2.  Let  m,,'  =  I , ^  (m,,  -  e) .  Then  m  ,  . 

<  m,2,  and  A^B/fm)  -  e)  =  A,1J(m,i').  Continue  the  process  of  substi¬ 
tution  we  have  A,j(Bj’(m)  -  e)  =  AlnJ(m,n’)  =  Ij(m,n')  where  m,n'  <  But 

Ij(min')  <  Ij(mj  -  £)  =  Ij(Cj(m)  -  e)  <  m.  Therefore  Aj^B/tm)  -  e)  <  m. 

O.E.D. 

(V)  PROOF  OF  LEKKA  2.1 

Lemma  2.1 .  Let  Tk,  T,  and  T j  be  transaction  classes  in  a  TH3  cf  a  legal 
database  partition,  and  Tj  and  Tj  are  on  one  critical  path.  Then  for  any  time 
value  m,  if  I  ( t , )  <  Ek'(m)  and  I(t2)>  Ekj(m)  then  there  exists  no  t,  -  t2  in 
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W 


the  transaction  dependency  graph  TG(S(TU))  where  the  schedule  S  enforces  the 
PSR ' s . 

Proof.  Let  T  k i  be  the  class  such  that  kl  is  the  fisrt  index  in  UCP„  ’  where 
Tk1  and  T,,  T,  are  on  one  critical  path.  (k  and  kl  are  not  necessarily 
distict.)  Then  kl  will  also  be  the  first  such  index  in  UCPkJ,  and  the  subset 
of  the  ordered  set  UCPk’  up  to  kl  and  that  of  UCPkJ  up  to  kl  are  equivalent. 


m 

(This  is 

i 

L  . 

Consider 

L  . 

(1) 

sr 

In  this  case,  Ek’(m)  =  EkJ(m).  Since  t,  and  t2  are  in  the  same 
class,  by  intra-class-synchronization  rule  we  have  l(t1)<  I(t2), 
which  implies  that  there  exists  no  t,  -  1 2 • 

(2)  i  =  kl  *  j.  Two  cases  are  considered: 

(2.1)  T,  t>  Tj. 

I(t2)  >  Ek 3 (m)  implies  that  AJ,(I(t2))  >  1  (EkJ (m) )  =  Ajk1(Ek1J 

(Ekk,(m))).  From  Property  2.1  we  have  h3 k  1  (Bk  , J (Ekk 1 (m) ) )  5 
Ekk1(m)  =  Ek’(m)  >  lit,).  Therefore  AJ1(I(t2))  >  IftO,  which 
implies  that  there  exists  no  t,  -  t2. 

(2.2)  Tj  f>  T,. 

I  (t ,)  <  Ek  1  (m)  implies  A,j(I(t,))  <  A,J(Ek’(ir,))  =  EkJ(m)<  I(t2). 

Therefore  A,J(I(t1))  <  I(t2),  which  implies  that  there 
exists  no  tfl  -  1 2. 

(3)  j  =  kl  *  i.  Two  cases  are  considered: 


(3.1)  T,  f>  T,. 
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I(t2)  2  Ek J (m)  implies  that  Aj’(I(t2))  *  Aj ’ (EkJ (m) )  =  Ek 1 (m) > 
Kt,)-  Therefore  Aj'(l(t2))  >  I  (t  ()  ,  which  implies  that  there 
exists  no  t,  -  1 2 . 

(3.2)  Tj  t>  T,. 

I(t,)  <  Ek'(m)  implies  I(t,)S  Ek 1 (m)  -  e,  which  implies  A , J 
(I(t1))  S  A , j  (B  j  '  (EkJ'  (m) )  -  f).  From  property  2.2  we  have  A,J 
(BJ'(EkJ(m))  -  e)  <  EkJ (m) .  Since  I(t2)  >  EkJ(m),  therefore  A,j 
(I(t,))  <  I(t2i,  which  implies  that  there  exists  no  t 1  -  t2. 

(4)  i  #  j  #  kl.  Six  cases  are  considered: 

(4.x)  T ,  t>  Tk1  t>  Tj. 

I  (t  2)  >  EkJ(m)  implies  that  AJi(I(t2))  2  A  j  1  (EkJ  (m) )  =  A/(Bk1J 
(Ek  k  1  (m) ) )  =  Aki'(Ajk1(Bk1J(Ekk1(m)))).  By  property  2.1,  Ak  .  1 
(Ajk1(Bk1j(Ekk1(m)))  >  Ak1’(Ekk1(m))  =  EkMm).  Since  Ek’(m)  > 
I(t,) ,  we  have  Aj’(I(t2))  >  I(t1),  which  implies  that  there 
exists  no  t ,  -  t2. 

(4.2)  Tj  t>  Tk i  ♦>  T i . 

I(t,)  <  Ek’(m)  implies  I(t,)  <  Ek’(m)-  e,  which  implies  A,j 

(I(t,))  <  A,J(Eki(m)  >  e)  =  Ak , J (A  * k 1 (Bk , 1 (Ek k 1 (m) )  -  e)).  Let 
m'  =  A  ,  k  1  (Bk  1 1  (Ekk  1  (m) )  -  e).  By  property  2.2  we  have  m'  <  E^'cm). 

Therefore  Ak  , j (A  , k  1  (Bk  , 1 (Ekk 1 (m) )  -  e))  =  Ak1J(m')  < 

Ak ! i (Ekk 1 (m) )  =  EkJ(m)  <  I(t2).  Therefore  A,J(I(t1))  <  I(t2) 
which  implies  there  exists  no  t,  -  t2. 

(4.3)  Tk1  >  T,  >  Tj. 


t;- 
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I(t2)  >  EkJ(m)  implies  Aj'(I(t2))  >  Aj'(EkJ(m))  = 

(m) ) )  i  Ek'(m)  >  I(t,).  Therefore  AJ1(I(t2))  >  Ift,),  which 
means  that  there  exists  no  t,  -  t2. 

(4.4)  T j  t>  t,  t>  Tk1. 

I(t,)  <  Ek'(m)  implies  A,J(I(t,))  <  A,J(Ek'(ni))  =A  , J  (Ak  (Ek  *■  1 
(m))  =  Ak  i J (Ekk 1 (m) )  =EkJ(m)  <  I(t2).  That  is,  A1J(I(t1))  < 

I(t2)  which  means  there  exists  no  t(  -  t2. 

(4.5)  Tk1  +>  Tj  t>  T,. 

I(t1)  <  Ek'(m)  implies  I(t,)i  E k  1  ( m )  -  e,  which  means  A,;(I(t-)) 
<  A1J(Ek’(m)  -  e)  =  A , J (B 3  ' (Ek J (m) )  -  e)  <  EkJ(m)  <  I(t2). 
That  is,  Ai-i(I(t1))  <  I(t2),  which  means  that  there  exists  nc 
1 1  -  t2. 

(4.6)  T,  t>  Tj  t>  Tk1. 

I(t2)  >  EkJ (m)  implies  A  j  1  ( I  (t  2) )  >  Aj'(EkJ(m))  =  A }  '  (Ak  - : (Ek k 1 
(m)  )  =  Ek'(m)>  I  (t,) .  That  is  Aj'(I(t2))  >  Kt,),  which  means 

that  there  exists  no  t,  -  t2. 

For  each  of  the  group  above  we  have  permutated  the  level  of  the  dis¬ 
tinct  classes  and  for  a  total  of  11  cases  we  have  show’n  that  it  is 
impossible  to  have  t,  -  t2.  Therefore  we  prove  that  there  exists  no 
t)  -  t2.  0  E .D . 

(VI)  PROOF  OF  THEOREM  2 
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In  order  to  prove  Theorem  2,  we  first  give  the  following  definitions  and  a 
lemma  (Lemma  2.2.) 

Def i ni ti on .  a  consistent  transaction  set  with  respect  to  a  schedule  S(T I , 

abbreviated  as  a  CS  w.r.t.  S(T),  is  a  set  of  transactions  Tcs  C  T  such  that  if 
t  £  Tcs  and  if  there  exists  t,  £  T  such  that  t  -  ...  -  t,  C  TG(S(T)),  (i.e., 
if  t  depends  on  t,  in  the  transitive  closure  of  -) ,  then  t,  t  Tcs. 

Property  2.3.  ( The  Property  of  a  consistent  transaction  set.)  Partition  tu 
into  Tu 1  and  Tu2.  Then  Tu1  is  a  consistent  transaction  set  w.r.t  S(T)  iff  for 
any  two  transactions  t,,t2,  such  that  t,  £  Tu1  and  t2  £  Tu2,  there  exists  nc 
t1  -  t2  in  the  transaction  dependency  graph  TG(S(T)). 

Proof .  We  want  to  show  that  the  following  two  parts  are  true: 

(1)  If  Tu1  is  a  CS  then  there  exists  no  t1  -  t2. 

By  definition  of  a  CS,  if  tt  e  Tu1  and  t,  -  t2,  then  t2  must  be  also 
in  Tu1,  which  contradicts  the  given.  Therefore  there  exists  no  t ,  - 
1 2* 

(2)  If  there  exists  no  t,  -  t2  for  any  t,  e  Tu1  and  t2  f  Tu2,  then  Tu'  is 
a  CS. 

Tu 1  is  a  CS  because  no  transaction  in  Tu1  can  have  a  dependency  in  the 
transitive  closure  on  a  transaction  which  is  not  in  Tu1. 

Therefore  we  conclude  that  this  property  is  true.  Q.E.O. 


c;a 
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Def i ni ti on .  Given  a  time  value  m  and  a  Starting  transaction  class  Ts,  a 
designated  consistent  transaction  set,  denoted  as  Tcs(m,s),  is  a  consis-ent 
transaction  set  such  that  for  all  t  t  Ts,  t  e  Tcs(m,s)  iff  I(t)  <  m. 

Lemma  2.2.  parition  Tu  into  Tu1  and  Tu2.  Then  Tu1  is  the  designated  con¬ 
sistent  transaction  set  Tcs(m,s)  w.r.t.  S(TU),  where  the  schedule  S  enforces 
the  PSR ..  if  Tu  1  contains,  for  all  i,  all  and  only  transactions  t  such  that 
I(t)  <  Es’(m)  where  t  6  .T , . 

Proof  .  Construct  a  time  wall  TW(m,s).  Then  by  the  previous  lemma  (Lem::, a 
2.1)  we  know  that  for  any  ; ,  k,  if  t,  f  Tj  and  I(ti)  <  EsJ(m),and  t2  £  Tk  and 
I ( 1 2 )  -  Esk(m)  then  there  exists  no  1 1  -  t2-  Therefore  by  Property  2.3  above 
we  know  that  Tu1  is  a  consistent  transaction  set  if  it  contains  lor  ail  i  only 
transactions  t  such  that  i(t)  <  Es'(m)  where  t  e  T,.  And  since  Ess(m)  =  m,  we 
have  I(ti)  <  m  if  t ,  e  Ts.  Therefore  Tu'  must  be  the  designated  consistent 
transaction  set  Tcs(m,s).  Q.E.D. 

Corollary.  Given  a  time  value  m  and  a  starting  transaction  class  Ts,  there 
exists  a  designated  consistent  transaction  set  Tcs(m,s). 

Theorem  2. 

Proof.  Partition  Tu  into  Tu’  and  Tu2  such  that  for  all  t  e  T . ,  for  a:: 
t  c  Tu  1  iff  I  (t)  <  Es’(m).  Then  it  is  clear  that  dependencies  induced  by 
must  be  arcs  that  gc  from  t  p  to  transactions  :n  Tu  1 ,  and  arc?  from  » ran: - 
actions  in  Tu2  to  tR.  By  Lemma  2.2,  there  exist  no  dependencies  from  t  ran. - 
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actions  in  Tul  to  those  in  Tu2.  Therefore  arcs  introduced  by  tR  will  net 
introduce  any  cycle  into  the  original  TG(S(TU)).  Since  TG(S(TU))  has  nc 
cycle,  therefore  TG(s(Tu  U  tR))  has  nc  cycle.  Q.E.D. 


